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Summary 

\ 

A  class  of  nonparametric  tests  based  on  the  third  quad¬ 
rant  layer  ranks  has  recently  been  studied  by  Woodworth 
in  connection  with  the  problem  of  testing  for  independence  in 
a  bivariate  distribution.  In  the  present  work,  exact  one-sided 
rejection  regions  are  tabulated  for  the  normal  score  layer  rank 
test  which  is  asymptotically  locally  most  powerful  for  positive 
dependence  in  the  bivariate  riormal  distribution.  The  cut-off 
points  are  tabulated  for  sample  sizes  n*4(l)9  and  significance 

^  Vo¬ 
le  ve  Is  a*. 10,  .05,  .025  and  .01.  Normal  and  Edgeworth  approxi¬ 
mations  for  the  significance  probabilities  are  also  given.  A 
simplified  version  of  the  normal  score  test  is  proposed  and  its 
rejection  regions  are  tabulated.  These  tests  are  compared  with 

the  correlation  coefficient  test,  Kendall's  t  test  and  Spear- 

< 

man's  rank  correlation  test  for  independence  by  means  of  Monte 
Carlo  evaluation  of  power  employing  10,000  trials  from  each  of 
three  different  types  of  bivariate  distributions.  Also  included 
is  a  brief  description  of  the  computing  aspects  of  the  problem 
that  may  prove  useful  in  similar  studies.  (  ) 
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1.  Introduction 

Let  £i“(Xi»Yi)»  i*l,2,...,n  be  a  random  sample  from  a  bi¬ 
variate  distribution  having  the  continuous  cumulative  distri¬ 
bution  function  (cdf)  F(x,y).  The  problem  of  testing  for  in¬ 
dependence  between  X  and  Y,  that  is,  of  testing  the  null  hy¬ 
pothesis  H  :F(x,y)*G (x)H(y)  against  the  alternatives  of  positive 
dependence  has  been  studied  extensively  in  the  literature.  The 
UMP  unbiased  test  under  the  bivariate  normal  model  is  based  on 

the  sample  correlation  coefficient  r.  Various  nonparametric 
« 

tests  have  also  been  proposed.  The  two  classical  ones  are  the 

tests  based  bn  Spearman's  rank  correlation  r  and  Kendall's  t. 

s 

A  unified  treatment  of  the  asymptotic  distribution  theory  of 
these  and  other  rank  order  tests  has  been  given  by  Hajek  and 
§idak  [6].  Lehmann  [10]  gave  some  mathematical  characterizations 

i 

of  positive  dependence  and  demonstrated  that  and  Kendall's  t 
possess  desirable  properties  for  specific  types  of  positive 
dependence . 

A  new  class  of  distribution-free  tests  of  independence 
based  upon  the  "3rd  quadrant  layer  ranks"  has  been  studied  by 
Woodworth  [13].  The  3rd  quadrant  layer  rank  of  is  defined 
as  the  number  of  points  (X^-X^/Y^-Y^)  ,  l<j£n,  that  lie  in  the 
closed  3rd  quadrant.  Let  X <X <» • ,<x(n)  and  set 
~(i)**X (i)  'Y[i] )  *  i*l»2,...,n  where  Y^j  is  the  Y-component  of 
the  vector  having  the  jth  smallest  X-component.  The  3rd  quad¬ 
rant  layer  rank  (henceforth  to  be  called  simply  "layer  rank") 
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£(j)  of  jg|j)  is  then  the  rank  of  Y^j  among  Y [i j  »Y [2]  '  *  *  *>Y  [  j  J  * 

The  class  of  layer  rank  statistics  considered  in  [13]  has 
n 

the  structure  T  »  T  E _(i(j),j),  where  E  (i,j),  l<i<j<n  is  a 
n  ^  n  n  ™  “  “ 

triple  sequence  of  constants  representing  the  weiqht  function 
associated  with  the  layer  ranks.  It  includes  Kendall's  t  as 
a  special  case  as  can  readily  be  seen  by  choosing  En(i,j)«i. 
Another  important  member  is  the  normal  score  layer  rank  statistic 


(1.1) 


n 

^  brtj°j,Uj) 


where  c^-E  (V^ )  with  vj1^<...<vj^  an  ordered  sample  of  si2e 
j  from  the  standard  normal  distribution  and 


(1.2)  bnj  -  f  0  for  j-1 

I'nj-lj-l)"1  Y  cni  for  2<j<n. 

It  is  shown  in  [13]  that  for  the  alternatives  of  positive  de¬ 
pendence  in  the  bivariate  normal  family,  the  test  which  rejects 
Hq  for  large  values  of  is  asymptotically  locally  most 

powerful  among  all  linear  layer  rank  tests. 

Although  the  asymptotic  properties  like  the  Pitman  and 
the  Bahadur  efficiencies  of  the  layer  rank  tests  were  studied 
in  [13] ,  no  table  of  significance  probabilities  was  given  to 
aid  in  carrying  out  the  tests  and  the  only  existing  tables  are 
for  Kendall's  t  statistic  [7,8].  In  Section  2,  we  provide  a 
table  of  upper  10%,  5%,  2.5%  and  1%  points  for  for  sample 
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sizes  n=4(l)9  as  well  as  the  normal  and  Edgeworth  approximations 
to  the  significance  probabilities.  The  handling  of  the  dis- 

0 

tribution  of  is  difficult  due  to  the  complicated  form  of 

sequence  {bnj).  Analogously  to  the  Kendall's  t  statistic,  a 
simplified  version  '  of  the  normal  score  layer  rank  test  is 
constructed  from  by  taking  the  bn^  identically  equal  to  1. 

Thus, 


(1.3) 


n 

jli  d3-‘<3>  • 


(2) 

A  table  of  upper  percentage  points  of  T^  '  is  also  provided  in 
Section  2. 

The  asymptotic  relative  efficiencies  of  the  tests  r,  r  ,  t 

S 

and  T^1^  have  been  studied  by  Konijn  [91,  Bhuchongkul  [21, 
Woodworth  [13]  and  others.  No  information  is,  however,  avail- 

i 

able  about  their  performance  in  small  and  moderate  sample  sizes. 

Computation  of  the  exact  power  being  extremely  difficult,  we 

present  in  Section  3  tables  of  the  empirical  power  of  all  these 
(2) 

tests  and  T '  under  three  bivariate  distributions.  From  the 
n 

enormous  number  of  samples  (10,000)  used  in  the  study,  one 
would  expect  that  the  empirical  values  are  fairly  close  to  the 
exact  power  for  the  alternatives  considered. 
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2.  Percentage  points  of  and  T^2^ 


Let  i*(t (1)  ,i  (2) , . . . ,1 (n) )  denote  the  vector  of  layer  ranks 

mm 

of  (Z . ,Z  . . .  ,Z  ,  . )  and  ^£(4)  the  set  of  all  possible  layer 
-li;  ~(zj  “'in; 

rank  vectors.  Under  H0,  the  different  components  of  t  are  mutually 

independent  and  t(j)  is  uniformly  distributed  over  the  set  of 

integers  {1,2,.. .,j},  l<.j£n  (cf.  Lemma  1.1  of  Brandoff-Nielsen 

and  Sobel  [1]).  Thus  the  nl  vectors  involved  in  the  set  *€(1) 

•* 

are  equally  likely  under  HQ.  .  Determination  of  the  significance 

points  of  a  layer  rank  statistic  TR  requires  generation  of  the 

set  *€(*),  calculation  of  the  values  of  T  over  the  set  end  then 
-  n 

the  ordering  of  these  values. 

Certain  techniques  have  made  it  possible  to  substantially 

reduce  the  computing  time.  Both  and  T^2*  are  handled 

simulteneously ,  thus  avoiding  regeneration  of  ^(i).  Secondly, 

n  ~ 

it  is  found  that  an  auxiliary  statistic  U_(i)»  T  l(j)  (this  is 

n  "  j-1 

equivalent  to  Kendall's  t)  helps  to  generate  the  layer  rank 
vectors  l  leading  to  the  high  values  of  T f ^  and  t[2*  due  to 
the  monotonic  nature  of  the  weight  functions  involved.  This 
is  made  precise  in  the  following  lemma. 

Lemma  2.1.  For  integers  r,  n<r£n(n+l)/2,  define  the  subsets 
S(r)  of  *€(!)  by 


(2.1) 


S(r)  -  U  :  n  (£)  -  r). 
-  n  - 


Then  for  i*l,2 
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(2.2) 


max  T?1*  (t)  >  max  T*iJ  U)  . 
IxS (r)  n  '  ieS(r-l) 


M  \  11 

Proof.  Consider#  in  particular,  Tn  '  U)  ■  ^1bnjcj  ,t  ( j)  *  For 

every  £QeS(r-l),  there  exists  a  layer  rank  vector  *0eS (r)  such 

that  for  some  integer  i  (l<i<n)  ,  i0(i)+l»i0(i)  and  *0( j) 

for  all  j-l,2,...,n  with  j+1.  The  sequence  (bn;J>  is  non-n. ;gative 

and  non-decreasing  in  j.  Also,  for  each  fixed  j,  the  sequence 

{c^}  is  increasing  in  i.  This  entails  T^1*  (*0)  <  T^  ^  (lQ) 

and  hence  (2.2)  follows  for  i-1.  The  same  argument  applies 

for  T^2^  and  this  completes  the  proof, 
n 

To  avoid  generating  all  the  nl  elements  of  'tf(t),  the 
layer  rank  vectors  are  generated  in  decreasing  order  of  Un(t). 

For  instance  with  n*5,  we  start  from  S (15) ■{ (1,2, 3,4,5) )  and 
then  generate  the  set  S (14)*{ (1,2, 3,4,4) ,  (1,2, 3, 3, 5),  (1,2, 2, 4, 5)  , 
(1,1, 3, 4, 5)}  followed  by  the  set  S(135  and  so  on.  •  For  each  n  and 
each  test  statistic  T^  ,  a  reference  number  C A  is  chosen  which 
is  sure  to  lie  below  the  10%  cut  off  point  (hence  also  below  the  5%, 
2.5%  and  1%  points).  The  sets  S(r)  are  generated  in  decreasing 

order  of  r  and  for  each  max  T ^  (1)  is  computed.  As  soon  as 

ieS(r)  n  - 
•• 

this  becomes  less  than  min(C^,C j) ,  the  program  is  terminated. 

(2.2)  ensures  that  the  set  of  all  the  layer  rank  vectors  gen¬ 
erated  under  this  scheme  contain  the  critical  regions  of  the 
desired  sizes. 

Even  with  the  above  technique,  more  than  nl/10  values  of 

T^  are  calculated  and  these  are  to  be  ordered.  The  best 
n 

ordering  program  requires  computing  time  proportional  to  NlogN, 
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where  N  numbers  are  being  ordered.  An  oven  more  serious  pro¬ 
blem  is  the  lack  of  sufficient  storage  space.  With  n=9,  for 
instance,  about  45,000  values  are  to  be  ordered  for  each 
statistic.  It  is  frequently  experienced  that  the  computation 
of  the  test  statistics  takes  much  less  computer  time  than  the 
printing  out  of  the  computed  values.  The  program  is  therefore 
run  in  two  stages.  In  the  first  stage,  the  computed  values  of 
each  statistic  are  grouped  into  a  histogram  having  1,000  cells. 
The  intervals  containing  the  'four  percentage  points  under  con¬ 
sideration  are  located.  In  the  final  run,  the  values  within 
these  four  intervals  are  ordered  and  printed  out  and  from  these 
the  exact  cut  off  points  are  determined.  About  15  minutes  were 
required  for  the  case  n*9. 

The  10%,  5%,  2.5%  and  1%  upper  cut  off  points  of  T*1*  and 
(2)  n 

T ...  for  n**4(l)9  are  presented  in  Table  1.  In  some  cases  the 

i 

nominal  a  can  only  be  attained  through  randomization  on  the 
boundary.  An  entry  within  braces  represents  the  nonrandomized 
significance  probability  (in  percentage)  corresponding  to  the 
cut  off  point  marked  by  an  asterisk  immediately  above  it.  Th'2 
number  immediately  below  is  the  next  lower  value  of  the  test 
statistic  which  is  to  be  randomized.  For  instance,  with  n*4, 
a=.10,  the  test  based  on  T ^  is  the  randomized  test:  reject 
Hq  with  probability  1  if  T^^ >1.8260  and  reject  HQ  with  pro¬ 
bability  (.10-1/12)  if  T^1** 


1.8122 


Table  1  The  upper  percentage  points  of  the  null  distributions  of  the  normal  score  layer  rank  statistics 
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In  order  to  investigate  the  accuracy  of  the  large  sample 
approximation,  we  evaluate  both  the  normal  approximation  and  the 
Edgeworth  expansion  containing  the  first  non  zero  correction  term. 
The  latter  expansion  assumes  the  form  (see  Cramer  [4]  pp.  88) 

t  r 

(2.3)  P(--  rg"T  <  x)  -  *(x)  +  -  *(4)<X)  +  0(n"2) 

W  nI2n  (4 1 ) 

where  ♦  is  the  standard  normal  cdf,  4^)  i8  its  fourth  derivative, 

o2(Tn)  is  the  variance  and  nr^n  is  the  vth  cumulant  of  under 

the  null  hypothesis.  By  symmetry  of  the  null  distribution  the 

odd  cumulants  are  zero  for  both  T ^  and  t[2*  .  For  T*1* ,  we  have 

n  n  n 


12) 

The  corresponding  expressions  for  T^  are  obtained  by  setting 
the  *>nj's  equ/*l  to  one.  The  results  appear  in  Table  2  and  they 
show  that  both  approximations  are  very  good  even  for  small 


samples 
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3.  Some  dependence  alternatives  and  empirical  power 


The  following  five  tests  of  independence  are  included  in 
the  present  study  for  the  evaluation  of  their  relative  perfor¬ 
mance  in  small  samples  under  specific  types  of  dependence. 

(i)  The  correlation  coefficient  r. 

a  ^  e  H  _  *  | 

(ii)  Spearman's  rank  correlation  r„  *  12(n  -n)~  £  (}-2ii)R. , 

8  j-1  2  3 

where  R^,...#Rn  are  the  ranks  of  Y [ij • • •  •  [nj • 

(iii)  Kendall's  statistic  ti  ( (?) l"1  5  [t ( j) - ( j+l)/2) . 

2  j-1 

(iv)  The  asymptotically  locally  most  powerful  normal  score 

layer  rank  test  T^  defined  by  (1.1)  and  (1.2). 

(2) 

(v)  A  simplified  normal  score  layer  rank  test  T'  7  defined  in 
(1.3). 

Three  sample  sizes  n*5,7,9  and  two  significance  levels 

a-. 05  and  .025  are  considered  for  each  test.  The  one-sided 

cut-off  points  of  T^  and  T^  are  obtained  from  Table  1, 

those  of  r  and  Kendall's  t  are  read  from  [8]  and  for  the  cut- 
s 

off  points  of  v,  we  use  the  fact  that  the  null  distribution  of 
(n-2) 1^2r/(l-r2) 1^2  under  normality  is  student's  t  with  (n-2) 
degrees  of  freedom. 

The  following  three  essentially  different  types  of  positive 
dependence  are  included  in  the  study. 

(a)  X  and  Y  have  the  bivariate  normal  distribution  with  positive 
correlation  coefficient  p.  The  tests  r  and  T^  are  optimal  for 
this  family  in  two  different  senses.  It  is  interesting  to  in¬ 
vestigate  the  comparison  of  the  two  as  well  as  the  manner  in 
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which  they  compare  with  the  other  three  tests  which  are  not 
optimal  for  this  family.  All  the  five  tests  are  location  and 
scale  invariant.  Without  loss  of  generality  we  take  the  means 
to  be  zero,  variances  arbitrary  and  p».l,  .3,  .5,  .7  and  .9  to 
cover  evenly  the  whole  range  of  positive  dependence. 

(b)  A  model  of  positive  dependence  which  often  arises  in  factor 
analysis: 

(3.1)  X  =  V+0Z 

Y  =  W+0Z 


where  V,  w  and  7.  are  independent  random  variables  and  0  is  the 
parameter  of  positive  dependence.  The  null  hypothesis  is  equi¬ 
valent  to  Hq:0«O.  This  model  was  considered  by  Bhuchongkul  [2) 
and  also  by  Hajek  and  Sidak  (6]  in  the  derivation  of  locally 
optimal  rank  tests  for  independence.  The  particular  case  of  V, 

W  and  Z  being  independent  uniform  random  variables  on  (0,1)  is 

treated  here.  The  correlation  coefficient  between  X  and  Y  is 
2  2 

p»0  /(1+0  )  >  0.  The  values  of  0  are  selected  to  yield  alter¬ 
natives  which  correspond  to  the  values  of  p  in  case  (a) . 

(c)  The  bivariate  exponential  distribution  introduced  by  Marshall 
and  Olkin  [11] .  It  is  essentially  a  three  parameter  family 
with  the  cdf  determined  from 

(3.2)  P(X>x,Y>y)  =  expI-Xj^-Xjy-X^max (x,y) ) , 

x,y>0,  X1>0,  X2>0'  *12-®’ 
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The  X  and  Y  marginals  are  univariate  exoonential  with  means 
(Aj+A^)”1  and  (*2+*12^-*  respectively  and  the  correlation  co¬ 
efficient  is  given  by  p*A^2 ^i+*2+^12^ ”**  Marshall  and  Olkin 
[11]  showed  that  the  distribution  (3.2)  arises  very  naturally 
in  certain  life  testing  situations.  It  has  an  interesting 
feature  in  that  although  the  marginals  have  exponential  distri¬ 
butions,  the  bivariate  distribution  has  a  singular  part  in 
addition  to  an  absolutely  continuous  part.  To  reduce  the  number 
of  parameters,  we  consider  the  case  of  identical  marginals 
A^“A2»Aq,  say.  Since  all  the  tests  are  scale  invariant,  their 
powers  would  depend  only  on  Q*^^2^^o  and  hence,  without  loss  of 
generality,  we  take  Aq»1  and  A12«0.  Then  we  have  p**0/(0+2). 

The  alternatives  0  are  again  chosen  so  as  to  have  the  same  values 
of  p  as  mentioned  in  (a)  . 

10,000  samples  are  generated  in  each  case  for  the  evaluation 

i 

of  empirical  power.  To  reduce  the  time  for  generating  the 
samples  and  also  to  provide  a  possibly  better  basis  for  comparison, 
nine  pairs  of  observations  (X^,Y^)  are  generated  each  time.  The 
first  five  are  used  for  the  case  n-5,  the  first  seven  for  the 
case  n«7  and  all  nine  for  the  case  n-9.  As  a  preliminary  to 
generating  the  (X,Y)  observations,  independent  uniform  (0,1) 
random  numbers  are  first  generated  following  the  scheme  dis¬ 
cussed  in  Moshman  [12] .  Appropriate  transformations  are  then 
used  to  convert  to  the  distributions  specified  in  (a) ,  (b)  and 
(c) .  Specifically,  let  Z^,  z2  and  be  three  independent 
uniform  (0,1)  random  variables  and  set 


(3.2)  X(a) 
Y(a) 

(3.3)  X(b) 

Y  (b) 


(-21n7<1)1/2  sin(2TT7.2)+D(l-p2)"1/2(-21nZ1)1/2cos(2TrZ2) 
(-21nZ1) 1/2  cos  { 2ttZ2 )  , 

z1+[p/(i-p)]1/2z3 

Z2+(p/(l-p) ]1/2Z3 


and 

(3.4)  X(c)  =  minl-lnd-Z^  ,  - (1-p) / (2p) In (1-Z3) ] 
Y(c)  *  min(-ln(l-Z2) #  - (1-p)/ (2p) In (1-Z31  . 


It  is  easy  to  verify  that  (X^,  Y^)  is  a  bivariate 
normal,  (X^,  Y^)  conforms  to  the  model  (b)  ,  (X*c^,  Y^)  has 
a  bivariate  exponential  distribution  and  the  correlation  in 
each  case  is  p.  The  transformation  (3.2)  is  due  to  Pox  and 
Muller  (31  and  (3.4)  follows  from  the  results  in  [11]. 

Independent  sets  of  uniform  random  numbers  are  used  for 
each  of  the  three  different  models.  However  for  each  model, 
the  same  set  of  uniform  random  numbers  are  used  to  generate 
each  set  of  the  (X,Y)  observations  corresponding  to  an  alter¬ 
native  p.  The  observations  in  any  set  are  obtained  by  merely 
changing  the  value  of  p  in  the  transformations  introduced  above. 
The  number  of  rejections  of  HQ  out  of  a  total  of  10,000  samples, 
are  presented  in  Table  3  for  model  (a) ,  Table  4  for  model  (b) 
and  Table  5  for  model  (c) .  Hence  the  entries  in  these  tables 
need  only  be  divided  by  10,000  to  yield  the  estimated  power. 
Finally,  the  number  of  rejections  are  also  given  for  p*0  which 
is  equivalent  to  the  hypothesis  of  independence  in  each  model. 


Table  3.  Number  of  rejections  out  of  10,000  trials  under  dependence  alternatives  in  model  (a). 


16 


Eg 

in 

xp 

CO 

rH 

H 

rH 

IN 

CM 

XT 

XT 

rH 

n1 

VO 

xp 

© 

cn 

© 

rH 

O 

(M 

cn 

CM 

in 

P* 

rH 

Oi 

p 

rH 

© 

cn 

9 

p* 

© 

cx 

00 

xp 

o 

CM 

rH 

px 

CM 

cn 

© 

CM 

cn 

o 

I 

ID 

00 

Ci 

cn 

P* 

c 

cx 

00 

*P 

rx 

Oi 

px 

© 

■ 

m 

O 

CO 

cn 

vo 

00 

00 

«0 

m 

CM 

VO 

rH 

cn 

VO 

© 

xp 

CM 

r> 

o 

«p 

m 

p- 

© 

© 

VO 

CM 

rH 

p- 

CM 

VO 

xp 

■ 

px 

VO 

CM 

c 

in 

c 

Cl 

•9 

VO 

c 

in 

CP 

© 

xp 

© 

CM 

xp 

VO 

H 

cn 

<P 

-C 

cn 

rH 

cn 

xp 

rH 

cn 

xp 

© 

CN 

VO 

00 

c 

00 

in 

C 

XI 

cn 

in 

tx 

CM 

** 

px 

CM 

rH 

O 

© 

o 

o 

c 

o 

m 

cn 

VO 

CM 

vo 

o 

CM 

© 

© 

|  % 

• 

cn 

rH 

c 

c 

p* 

cn 

o 

VO 

rH 

o 

vo 

xp 

© 

vo 

cn 

H 

CM 

CM 

rH 

CM 

H 

rH 

CN 

rH 

rH 

CM 

rH 

rH 

CM 

It 

a 

0 

cn 

VO 

rH 

o 

Ci 

XT 

03 

rx 

00 

CJ 

CX 

© 

© 

cn 

cn 

CN 

o 

in 

rH 

rH 

p> 

C 

in 

vo 

P* 

cn 

VO 

00 

rH 

• 

VO 

© 

CM 

in 

CO 

o- 

in 

r* 

c 

in 

px 

o 

© 

px 

o 

rH 

H 

rH 

rH 

rH 

cn 

m 

rH 

vo 

o 

00 

» 

in 

in 

CM 

P» 

«P 

CM 

© 

© 

• 

rH 

r~ 

CM 

c 

px 

o 

N 

00 

rH 

in 

O 

rH 

SO 

© 

cn 

n 

«P 

CM 

in 

xr 

cn 

cn 

xr 

m 

cn 

M* 

cn 

cn 

xr 

xr 

m 

vo 

rH 

f 

oo 

CM 

vo 

p- 

in 

rH 

|x 

o 

cn 

o 

CN 

cn 

CM 

© 

* 

in 

vo 

cn 

cn 

fx 

cn 

xp 

© 

CM 

CM 

CM 

CM 

CM 

CM 

N 

CM 

CM 

CM 

CM 

CM 

CM 

CM 

CM 

_ 

09 

XT 

xp 

c 

cn 

r» 

© 

cn 

in 

CM 

in 

rH 

SO 

rH 

© 

p* 

Cl 

r- 

vo 

vO 

m 

0 

CM 

in 

VO 

vo 

o 

px 

CM 

• 

o 

xr 

00 

p* 

in 

in 

N 

cn 

cn 

cn 

in 

in 

cn 

© 

00 

Cl 

c 

in 

00 

c 

© 

00 

c 

vo 

00 

CM 

V£ 

CO 

© 

CM 

rH 

o 

o 

cn 

xr 

c 

in 

vo 

00 

in 

© 

xp 

© 

cn 

p- 

o 

00 

cn 

CM 

cn 

H 

in 

o 

xp 

o 

CM 

«p 

[X 

XP 

© 

• 

<N 

rH 

cn 

o 

<P 

CM 

00 

o 

cn 

© 

xf 

CM 

© 

cn 

VO 

r* 

cn 

in 

VO 

cn 

M* 

vo 

cn 

in 

VO 

cn 

xp 

© 

O 

• 

cn 

xr 

00 

vo 

in 

in 

p* 

00 

rH 

X* 

00 

in 

CM 

CM 

in 

© 

© 

c 

rH 

XT 

cn 

r* 

CM 

m 

P- 

in 

fx 

px 

H 

O 

N 

Q 

• 

IN 

cn 

rH 

Ci 

rx 

m 

00 

VO 

cn 

00 

cx 

in 

00 

px 

© 

CM 

cn 

xp 

H 

CM 

m 

H 

CM 

cn 

rH 

CM 

cn 

H 

CM 

cn 

0 

in 

xp 

m 

o 

rH 

XT 

m 

00 

cn 

00 

r*» 

rH 

vo 

© 

© 

cn 

cm 

PO 

<r 

CM 

rH 

px 

o 

OO 

00 

O' 

cn 

00 

© 

H 

Xp 

• 

CM 

vo 

o 

H 

P* 

rH 

cn 

vo 

o 

XT 

px 

o 

XP 

rx 

H 

rH 

CM 

rH 

H 

H 

H 

rH 

rH 

rH 

H 

rH 

H 

H 

rH 

CM 

p- 

00 

vo 

cn 

O 

H 

c 

vo 

OV 

cn 

in 

o 

© 

CM 

H 

in 

CM 

»P 

m 

cn 

00 

CM 

vo 

CM 

rH 

rx 

cn 

H 

rx 

• 

io 

r* 

00 

vo 

px 

rx 

10 

P» 

r 

VO 

p* 

p* 

vo 

px 

px 

VO 

CM 

rH 

m 

VO 

in 

CM 

vo 

VO 

in 

© 

© 

00 

© 

o 

in 

00 

rH 

tx 

rH 

o 

p* 

CM 

rH 

r- 

© 

© 

p* 

o 

© 

xp 

xr 

in 

<< 

in 

in 

in 

m 

»P 

Xp 

© 

C 

in 

p*» 

c 

in 

rx 

Cl 

in 

r» 

c 

in 

P* 

© 

© 

P* 

© 

4J 

M 

rH 

CM 

(0 

0) 

u 

M 

•p 

Eh 

E« 

Test 


17 


Ov 

rH 

ON 

on 

Ov 

O 

on 

0» 

fx 

Ov 

OV 

o 

ON 

VO 

on 

© 

H 

on 

OV 

rH 

on 

O 

ix 

ON 

OV 

VO 

ON 

VO 

rx 

^H 

ON 

00 

rH 

vo 

on 

rH 

ON 

fx 

CO 

r* 

OV 

r» 

OV 

vo 

vo 

Ov 

t" 

vo 

Ov 

© 

on 

© 

© 

o 

© 

in 

ON 

o 

© 

r- 

on 

ON 

r-' 

vo 

o 

fx 

fx 

on 

• 

rH 

cr. 

VO 

O' 

in 

rH 

© 

xr 

o 

Ov 

in 

ON 

rH 

rH 

ON 

rH 

ON 

rH 

ON 

rH 

ON 

rH 

m 

rH 

© 

o 

OV 

ON 

o 

ON 

VO 

ix 

© 

on 

O 

in 

vo 

VO 

00 

xr 

rx 

© 

in 

vo 

ro 

• 

VO 

00 

o 

in 

0" 

Ov 

© 

p» 

© 

in 

fx 

© 

rH 

H 

ON 

© 

ON 

00 

r~ 

rH 

ON 

rH 

on 

o 

in 

rH 

on 

ov 

on 

rH 

00 

xr 

ON 

rx 

o 

ON 

VO 

ON 

■ 

ro 

on 

xj" 

on 

on 

on 

on 

"V 

on 

on 

«T 

ro 

ON 

00 

ro 

© 

rH 

rH 

ON 

00 

rH 

in 

in 

00 

in 

rH 

00 

on 

© 

rH 

on 

in 

rH 

on 

VO 

rH 

o 

in 

VO 

on 

m 

r* 

© 

o 

X9* 

© 

ON 

VO 

rH 

on 

on 

XT 

©  rH  rH 

in  vo  vo 

N  (M  <N 


VO  xj<  in 

xr  fx  in 

n  n  rg 


m  r»  « 
©  ©  © 
N  N  N 


00  TT  |> 

in  vo  in 

N  IN  IN 


00  N  H 
in  vo  in 
in  n  N 


ON 

rH 

on 

00 

ON 

vo 

xr 

1" 

© 

00 

© 

© 

© 

00 

rH 

xr 

vo 

ON 

vo 

vo 

VO 

00 

© 

m  n>  rH 

on  ©  p* 

on  m  xr 

vo  oo  © 


in 

rH 

ON 

on 

on 

on 

© 

ix 

vo 

© 

© 

© 

© 

on 

© 

rH 

on 

o 

ON 

rx 

xr 

VO 

fx 

in 

» 

rH 

in 

© 

VO 

rH 

« 

xr 

ro 

vo 

on 

VO 

in 

in 

ON 

© 

rH 

VO 

© 

on 

xr 

in 

on 

vo 

o 

rH 

© 

ON 

rH 

rx 

on 

in 

© 

|X 

ON 

© 

in 

© 

© 

ON 

© 

© 

© 

© 

© 

on  vo  r* 
in  on  o 
,  iH  ov  in 
on  xt>  vo 


on 

© 

© 

© 

© 

on 

© 

O 

© 

ON 

on 

on 

on 

xr 

m 

|X> 

rx 

© 

rx 

© 

on 

rH 

ON 

on 

ON 

© 

xr 

© 

ON 

o 

p* 

© 

© 

rH 

ON 

on 

ro 

ON 

on 

r* 

© 

on 

© 

on 

rH 

ON 

<n 

r* 

ON 

© 

ON 

rH 

« 

rH 

© 

rx 

o 

rH 

on 

r* 

© 

ON 

o 

on 

© 

ON 

© 

m 

© 

on 

© 

o 

on 

© 

r» 

© 

© 

© 

© 

rH 

o 

on 

© 

ro 

rH 

© 

© 

© 

O 

o 

on 

© 

on 

rx 

on 

© 

© 

© 

© 

r* 

rx 

« 

rH 

© 

O 

O 

xr 

© 

© 

© 

rx 

© 

rH 

© 

o 

xr 

© 

© 

fx. 

rx 

o 

(H  on  on 
in  m  in 


in  r»  © 


© 

on 

© 

ON 

ON 

rH 

ro 

O 

© 

ON 

rH 

© 

on 

(N 

rx 

ON 

ON 

|X 

© 

r* 

rx 

© 

rx 

rx 

© 

ix 

IX 

rx 

ON 

xr 

© 

© 

© 

(N 

O 

ro 

o 

xr 

on 

©  rH  O 

rH 

ON 

o 

© 

© 

© 

© 

© 

© 

© 

© 

© 

© 

rx 

© 

© 

tx 

© 

© 

fx 

© 

ON 

Ex 


in 

M 


.Si£-  •  «=jWu»'3S**b  .&■ 


Table  5.  Number  of  rejections  out  ot  10,000  trials  under  dependence  alternatives  in  model  (c) . 
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4.  Remarks  and  Conclusions 

It  was  felt  that  the  generated  data  were  quite  reliable  since 
the  histograms  of  the  univariate  marginal  observations  were  well 
within  the  limits  of  the  goodness  of  fit  tests  in  each  case. 
Moreover,  to  check  the  extent  of  internal  variability  of  the  re¬ 
sults,  the  program  was  run  in  ten  groups  of  1,000  samples  each. 

The  variation  of  the  results  of  empirical  power  from  group  to 

group  was  found  to  be  quite  small.  This  observation  enhances 

# 

the  reliability  of  the  pooled  empirical  power  presented  in  the 
tables  of  Section  3.  Further,  the  results  for  r  were  compared 
with  selected  values  of  the  exact  power  under  normal  alternatives 
(see  David  [5])  and  were  found  to  agree  within  +  .01. 

Among  the  five  tests  included  in  the  study,  all  but  the 
correlation  coefficient  test  r  are  distribution- free.  As  men¬ 
tioned  earlier,  the  cut-off  points  for  r  were  takdn  from  student's 
t  distribution  which  holds  only  under  the  bivariate  normal  model. 
The  empirical  significance  probabilities  for  r  in  Table  3  are 
as  close  to  the  nominal  levels  as  they  should  be.  Curiously 
enough,  they  also  tend  to  agree  very  closely  in  Table  4.  The 
underlying  model  in  this  case  is  the  uniform  distribution  of 
(X ,Y)  on  the  unit  square.  Thus,  the  significence  levels  for 
the  parametric  test  r  seem  to  be  robust  with  respect  to  the 
uniform  distribution.  However,  this  property  does  not  hold  for 
the  bivariate  exponential  model.  This  is  evident  as  one  ex¬ 
amines  the  empirical  levels  of  r  in  Table  5. 


20  ‘  * 


Finally,  we  make  a  comparative  study  of  the  empirical  powers 
of  the  five  tests  under  consideration.  Regarding  the  bivariate 
normal  model  (a)  ,  there  is  clear  indication  that  r  has  substantially 
higher  power  than  all  the  other  tests  (see  Table  3) .  r  is  even 

noticeably  better  than  the  optimal  layer  rank  test  T^,  al¬ 
though  asymptotically  has  Pitman  efficiency  1  relative  to  r. 

The  r  test  also  seems  to  be  superior  to  all  the  others  for  the 

model  (b) .  However,  for  the  bivariate  exponential  alternatives, 

$ 

not  only  are  its  significance  levels  seriously  distorted,  but 

its  power  tends  to  lag  behind  those  of  all  other  tests.  Thus 

among  the  five  tests,  the  r  test  is  the  best  for  the  models  (a) 

and  (b) ,  but  it  is  the  poorest  for  model  (c) .  The  simplified 

12\ 

normal  score  layer  rank  test  T  ,  on  the  other  hand,  seems  to 
have  the  highest  power  among  all  five  for  the  model  (c)  ,  al¬ 
though  it  has  the  lowest  powers  for  (a)  and  (b) .  The  difference 
among  the  performances  of  T^  ,  r  and  t  are  somewhat  less  pro- 

9 

nounced.  Asymptotically,  r  and  t  have  relative  efficiency  1 

for  all  parent  bivariate  distributions  and  their  efficiency  re- 
#1)  2 

lative  to  T  is  (3/ir)  *>.912  for  the  bivariate  normal  model 

(a) .  In  Table  3,  their  empirical  powers  are  found  to  be  quite 
close  to  one  another  for  all  the  alternatives  p.  For  the  model 

(b)  ,  the  Spearman  test  r  seems  to  have  more  power  than  both 

S 

T ^  and  t.  The  difference,  though  moderate,  is  noticeable  for 
all  n,  all  alternatives  p  and  for  both  the  values  of  a.  On 
the  other  hand,  r  has  slightly  lower  power  than  T^  and  t 
for  the  model  (c) . 


xavwnwiaaijWi  i  mi 1 1  .iw  quwfflswwH 


21 


References 


[1]  Barndorff-Nielsen,  0.  and  Sobel,  M.  "On  the  distribution  of 
the  number  of  admissible  points  in  a  vector  random  sample" , 
Theory  of  probability  and  its  applications ,  11  (1966) 

No.  2,  249-269. 

[2]  Bhuchongkul,  S.  "A  class  of  nonparametric  tests  for  inde¬ 
pendence  in  bivariate  populations",  Ann.  Math.  Statist., 

35  (1964),  138-149. 

•m  «** 

[3]  Box,  G.E.P.  and  Muller,  M.  "A  note  on  the  generation  of 
random  normal  deviates",  Ann.  Math.  Statist.,  29  (1958), 
610-611.  •* 

[4]  Cramer,  H.  Random  variables  and  probability  distributions. 
(2nd.  ed.)  (1962)  Cambridge  Univ.  Press,  London. 

[5]  David,  F.N.  Tables  of  the  ordinates  and  probability  in¬ 
tegral  of  the"  distribution  of  the  correlation  coefficient 
in  small  samples'!  Biometrika  (1938)  ,  London. 

[61  Hajek,  J.  and  Sidak,  Z.  Theory  of  rank  tests.  (1967) , 
Academic  Press,  New  York. 

[7]  Kaarsemaker,  L.  and  Van  Wijngaarden,  A.  "Tables  for  use  in 

rank  correlation",  Statistica  Neerlandica,  7  (1953),  41-54. 

'  1  """  "  “ 

[8]  Kendall,  M.G.  Rank  correlation  methods.  (3rd  ed.)  (1962), 
Griffin  ft  Co. ,  London. 

[9]  Konijn,  H.S.  "On  the  power  of  certain  tests  for  independence 
in  bivariate  populations",  Ann.  Math.  Statist.,  27  (1956), 
300-323. 

[101  Lehmann,  E.L.  "Some  concepts  of  dependence",  Ann.  Math. 
Statist.,  37  (1966),  1137-1153.  _ 

[11]  Marshall,  A.W.  and  Olkin,  I.  "A  multivariate  exponential 
distribution".  Jour.  Amer.  Stat.  Assoc.,  62  (1967),  30-44. 

[12]  Moshman,  J.  "The  generation  of  pseudo- random  numbers  on  a 
decimal  calculator" ,  Jour,  of  the  Assoc,  for  Computing 
Machinery ,  1  (1954)  ,  88-91. 

[13]  Woodworth,  G.G.  "On  the  asymptotic  theory  of  tests  of  in¬ 

dependence  based  on  bivariate  layer  ranks".  Tech.  Report 
No.  75,  (1966)  ,  Univ.  of  Minnesota.  ^ 


